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Here we uncover the load and fault-tolerant backbones of the trans-European gas pipeline network. 
Combining topological data with information on inter-country flows, we estimate the global load 
of the network and its tolerance to failures. To do this, we apply two complementary methods 
generalized from the betweenness centrality and the maximum flow. We find that the gas pipeline 
network has grown to satisfy a dual-purpose: on one hand, the major pipelines are crossed by a 
large number of shortest paths thereby increasing the efficiency of the network; on the other hand, 
a non-operational pipeline causes only a minimal impact on network capacity, implying that the 
network is error-tolerant. These findings suggest that the trans-European gas pipeline network is 
robust, i.e., error tolerant to failures of high load links. 
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I. INTRODUCTION 



The world is going through a period when research 
in energy is overarching P, Oil and gas prices arc 
volatile because of geopolitical and financial crises. The 
rate of world-wide energy consumption has been acceler- 
ating, while gas resources are dwindling fast. Concerns 
about national supply and security of energy are on top 
of the political agenda, and global climate changes are 
now believed to be caused by the release of greenhouse 
gases into the atmosphere 

Although physicists have recently made substantial 
progress in the understanding of electrical power-grids [1, 
S, U @, S H, B , surprisingly little attention has been 
paid to the structure of gas pipeline networks. Yet, nat- 
ural gas is often the energy of choice for home heating 
and it is increa sing ly being used instead of oil for trans- 
portation 0, [llf Although renewable energy sources 
offer the best cuts in overall CO2 emissions, the gener- 
ation of electricity from natural gas instead of coal can 
significantly reduce the release of carbon dioxide to the 
atmosphere. As the demand for natural gas rises in Eu- 
rope, it becomes more important to gain insights into 
the global transportation properties of the European gas 
network. Unlike electricity, with virtually instantaneous 
transmission, the time taken for natural gas to cross Eu- 
rope is measured in days. This implies that the coor- 
dination among transport operators is less critical than 
for power grids. Therefore, commercial interests of com- 
peting operators often lead to incomplete or incorrect 
network information, even at the topological level. Until 



now, modelling has typically been made in small systems 
by the respective operators, who have detailed knowl- 
edge of their own infrastructures. Nevertheless, Ukraine 
alone transits approximately 80% of Russian gas exports 
to Europe , suggesting the presence of a strong trans- 
portation backbone crossing several European countries. 

Historically, critical infrastructure networks have 
evolved under the pressure to minimize local rather than 
global failures [l^. However, little is known on how this 
local optimization impacts network robustness and secu- 
rity of supply on a global scale. The failure of a few 
important links may cause major disruption to supply in 
the network, not because these links connect to degree 
hubs, but because they are part of major transportation 
routes that are critical to the operation of the whole net- 
work. Here we adopt the view that a robust infrastructure 
network is one which has evolved to be error tolerant to 
failures of high load links. Our method is slightly dif- 
ferent from previous work on real world critical infras- 
tructure networks with percolation theory [l^, 0, [l^ , 
which assume the simultaneous loss of many unrelated 
network components. The absence of historical records 
on the simultaneous failure of a significant percentage 
of components in natural gas networks implies that the 
methods of percolation theory are of little practical rele- 
vance in our case. Hence, our approach to the challenge 
of characterising the robustness of global transport on 
the European gas network was to characterise the hot 
transportation backbone which emerges when measuring 
network load and error tolerance. 



II. TRANS-EUROPEAN GAS NETWORKS 
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We have extracted the European gas pipeline network 
from the Platts Natural Gas geospatial data The 
data set cover all European countries (including non EU 




FIG. 1: (Colour online) European gas pipeline network. We show the transmission network [blue (dark gray) pipelines] overlaid 
with the distribution network [brown (light gray) pipelines]. Link thickness is proportional to the pipeline diameter. We 
projected the data with the Lambert azimuthal equal area projection [l^. Background colours identify EU member states. 



TABLE I: Basic network statistics for the transmission and 
complete European gas pipeline networks. The complete net- 
work is the union of the transmission and distribution net- 
works. 



Statistics 


Gas network 


Gas network 




(transmission) 


(complete network) 


Number of nodes 


2207 


24010 


Number of edges 


2696 


25554 


Total length [km] 


119961 


436289 



countries such as Norway and Switzerland) , North Africa 
(main pipelines from Morocco and Tunisia), Eastern Eu- 
rope (Belarus, Ukraine, Lithuania, Latvia, Estonia, and 
Turkey) and Western Russia (see Fig. [T]). 

Similarly to electrical power grids, gas pipeline net- 
works have two main layers: transmission and distribu- 
tion. The transmission network transports natural gas 
over long distances (typically across different countries), 



whereas pipelines at the distribution level cover urban 
areas and deliver gas directly to end consumers. We ex- 
tracted the gas pipeline transmission network from the 
complete natural gas network, as the connected compo- 
nent composed of all the important pipelines with diam- 
eter c? > 15 inches. To finalize the network, we added all 
other pipelines interconnecting major branches [l9j . We 
treated the resulting network as undirected due to the 
lack of information on the direction of flows. However, 
network links arc weighted according to pipeline diame- 
ter and length. 

The European gas pipeline infrastructure is a 
continent-wide sparse network which crosses 38 coun- 
tries, has about 2.4 x 10^ nodes [compressor stations, 
city gate stations, liquefied natural gas (LNG) termi- 
nals, storage facilities, etc.] connected by approximately 
2.5 X 10'* pipelines (including urban pipelines), span- 
ning more than 4.3 x 10^ km (see Table HI). The trans- 
European gas pipeline network is. in fact, a union of 
national infrastructure networks for the transport and 
delivery of natural gas over Europe. These networks 
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FIG. 2: (Colour online) Network of international gas trade 
movements by pipeline . Link thickness is proportional to 
the annual volume of gas traded. 



have grown under different historical, pohtical, economic, 
technological and geographical constraints, and might be 
very different from each other from a topological point of 
view. The Platts data set did not include volume or direc- 
tionality of flows. Hence, we assessed the global structure 
of the European gas network under the availability of in- 
complete information on flows. To reduce uncertainty 
on flows, we combined the physical infrastructure net- 
work with the network of international natural gas trade 
movements by pipeline for 2007 [T3| (see Fig. [5]). 

To investigate similarities among the national gas net- 
works, we first plotted in Fig. EJa) the number N of 
nodes, versus the number L of links for each country. 
Figure [3l^ a) suggests that both the transmission and the 
complete (i.e., transmission and distribution) networks 
have approximately the same average degree because all 
points fall approximately along a straight line. Indeed, 

we found (^transmission) = 2.4 and (/Ccomplete) = 2.1 Q ■ 

Surprisingly, the size of the complete European national 
gas networks ranges over three orders of magnitude from 
two nodes (former Yugoslav Republic of Macedonia) to 
10334 nodes (Germany). Further, the German transmis- 
sion network is considerably larger than the Italian net- 
work. Germany has a long history of industrial usage of 
gas and is a major hub for imports from Russia and the 
North Sea 

Given that the national networks have very different 
sizes, but approximately the same average degree, we 
looked for regularities in the probability distribution of 
degree of the European gas networks [see Fig. EJb)]. In 
accordance with previous studies of electrical transmis- 
sion networks [l3, [2l|, the complementary cumulative 
degree distribution of the transmission network decays 
exponentially as P{K > k) fa exp(— fc/A), with A = 1.44. 
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FIG. 3: (Colour online) (a) Plot of the number L of links 
versus the number A'^ of nodes for the transmission and com- 
plete (i.e. transmission and distribution) natural gas net- 
works of the countries analysed. The three largest national 
transmission and complete networks are also labeled (Ger- 
many, UK and Italy), (b) Plot of the complementary cu- 
mulative degree distribution of the European gas networks, 
together with national transmission gas networks larger than 
100 pipelines. The inset shows the degree distribution of the 
European complete and transmission networks on a double- 
logarithmic scale, highlighting the presence of fat tails on the 
degree distribution of the complete gas pipeline network. 



Unexpectedly, we found that the degree distribution of 
the complete gas network is heavy tailed, as can be seen 
in the inset of Fig. [3Kb) . This suggests that the gas net- 
work may be approximated by an exponential network 
at transmission level, but not when the distribution level 
is considered as well. The distribution network is mainly 
composed of trees which attach to nodes in the transmis- 
sion network; thus forming the complete network. Hence, 
the fat tails are a combined effect of increasing the degree 
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of existing transmission nodes in the complete network, 
and adding distribution nodes with lower degrees. 
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FIG. 4: (Colour online) Plot of the digitized Gas Transmission 
Europe (GTE) pipeline capacity versus pipeline diameter on 
a double logarithmic scale. We digitized the European gas 
network map from GTE and assigned the GTE capacities 
to pipelines in the Platts data set. The straight line is a 
regression to the data, which corresponds to c = ad°' with 
a = 2.46. 
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(b) 



Czech Republic 




The July 2007 release of the Platts data set, which 
we analysed, did not include information on the capacity 
of pipelines. To estimate the pipeline capacity, we com- 
pared cross-border flows based on capacity estimates for 
an incompressible fluid, where the capacity c can shown 
to scale as (P with pipeline diameter d (see the Ap- 
pendix), to reported cross-border flows extracted from 
the digitized Gas Transmission Europe (GTE) map p^ . 
Figure H] is a plot of averaged pipeline capacity ver- 
sus pipeline diameter in a double-logarithmic scale for 
pipelines in the GTE data set. We found a good match 
between the theoretical prediction oic d? with 7 ~ 2.6, 
and the capacity of major pipelines as made evident by 
the regression to the data c ~ with a = 2.46. Hence, 
we used the exponent 7 = 2.5 as a trade-off between the 
theoretical prediction and the numerical regression, and 
approximated c ~ d^'^. 

To understand the national structure of the network, 
we investigated the tendency of highly connected nodes 
to link to each other over high capacity pipelines. Fig- 
ure El^a) shows the Pearson correlation coefficient be- 
tween the product of the degrees of two nodes connected 
by a pipeline and the capacity of the pipeline, 

where ki and kj are the degrees of the nodes at the ends 
of pipeline e^, and Ce^^ is the capacity of the pipeline. 



FIG. 5: (Colour online) (a) Pearson correlation coefficient, 
r, for the degrees of two linked nodes and the capacity of 
the linking pipeline; and percentage of capacity on parallel 
pipelines, q. We considered only countries with more than 50 
pipelines, (b) Network layout for the Czech Republic. Link 
thickness is proportional to the pipeline capacity. 

Countries with high values of r have a gas pipeline net- 
work where degree hubs are interconnected by several 
parallel pipelines. Further, we plotted the percentage 
q of capacity on parallel pipelines for each national net- 
work. Typically, countries with high values of r also have 
high values of q. 

Figure EJa) shows relatively large differences among 
countries: Austria, the Czech Repubhc, Italy and Slo- 
vakia have both high values of r and q. A visual in- 
spection of the network in these countries uncovers the 
presence of many parallel pipelines organized along high 
capacity corridors [see Fig. E^b)]. Taken together, these 
results suggest that some European national networks 
have grown structures characterised by chains of high 
capacity (parallel) pipelines over-bridging long distances. 
This has the consequence of improving the local error tol- 
erance because the failure of one pipeline implies only a 
decrease in flow. 

Motivated by the finding of error tolerance in the gas 
pipeline networks, we then asked the question of whether 
there are global topological properties of the European 
network which could characterize the network robustness. 
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III. ANALYSIS OF MAN-MADE 
DISTRIBUTION NETWORKS WITH 
INCOMPLETE FLOW INFORMATION 



To gain insights into the overlaid infrastructure and 
aggregate flow networks, we propose two complementary 
approaches which aim at identifying the backbones of the 
overlaid networks in Figs. [1] and [2l In both approaches, 
the flow network allows an approximate estimate of the 
volume of directed flows as we wfll detail in llll Al 

In the first approach, we assume that transport oc- 
curs along the shortest paths in geographical space. We 
search for a global backbone characterised by the pres- 
ence of flow corridors where individual components were 
designed to sustain high loads. 

In the second approach, wc look at fault tolerance when 
single components fail. Recent studies of network vulner- 
ability in infrastructure networks suggest that, although 
these networks have exponential degree distributions, un- 
der random errors or attacks the size of the percolation 
cluster decreases in a way which is reminiscent of scale- 
free networks HI] . Typically, these studies presume 
that a large percentage of nodes or links may become non- 
operational simultaneously, i.e., the time scale of node or 
link failure is much faster than the time scale of repair. 
Whereas the underlying scenario of a hacker or terrorist 
attack on an infrastructure network causing large dam- 
age is certainly worth studying [2^, the consequences of 
such attacks may not be assessed properly when measur- 
ing damage by the relative size of the largest component. 
Here, we estimate the loss of flow when a single link is 
non-operational. We search for a global backbone char- 
acterised by corridors of interconnected nodes, where the 
removal of one single link causes a high loss of flow from 
source to sink nodes. 

The maximum flow problem and the shortest paths 
problem are complementary, as they capture different 
aspects of minimum cost flow. Shortest path problems 
capture link lengths, but not capacities; maximum flow 
problems model link capacities, but not lengths. 



A. Generalized betweenness centrality 



Many networks are in fact substrates, where goods, 
products, substances or materials flow from sinks to 
sources through components laid out heterogencously in 
geographical space. Examples range from supply net- 
works [2J], spatial distribution networks [25j and energy 
networks [6| to communication networks [26 1. Node and 
link stress in these networks is often characterised by 
the betweenness centrality. Consider a substrate network 
Gs — (Vs.Es) with node-set Vs and hnk-set Es- The 
betweenness centrality of link G Es is defined as the 
relative number of shortest paths between all pairs of 



nodes which pass through 



E 

s.teVs 



<^s,t 



(2) 



where Ug^t is the number of shortest paths from node s to 
node t and cr^.f (cy ) is the number of these paths passing 
through link By . The concept of betweenness centrality 
was originally developed to characterise the influence of 
nodes in social networks [13, HI] and, to our knowledge, 
was used for the first time in the physics literature in the 
context of social networks by Newman [2^ and in the 
context of communication networks by Goh et al. S^l 1^ ■ 

Betweenness centrality is relevant in man-made net- 
works which deliver products, substances or materials as 
cost constraints on these networks condition transporta- 
tion to occur along shortest paths. However, nodes and 
links with high betweenness in spatial networks are of- 
ten near the network barycentre [3l|, whose location is 
given by — Xi/N , whereas the most important in- 
frastructure elements are frequently along the periphery, 
close to either the sources or the sinks. Although flows 
are conditioned by a specific set of sources and sinks, 
the traffic between these nodes may be highly heteroge- 
neous and one may have only access to aggregate trans- 
port data, but not to the detailed flows between individ- 
ual sources and sinks (e.g. competition between opera- 
tors may prevent the release of detailed data). Here we 
propose a generalization of betweenness centrality in the 
context of flows taking place on a substrate network, but 
where flow data are available only at an aggregate level. 
We then show in the next section how the generalized 
betweenness centrality can help us to gain insights into 
the structure of trans-European gas pipeline networks. 

The substrate network is often composed of sets of 
nodes which act like aggregate sources and sinks. The 
aggregation can be geographical (e.g., countries, regions 
or cities), or organizational (e.g., companies or institu- 
tions). If the flow information is only available at aggre- 
gate level then a possible extension of the betweenness 
centrality for these networks is to weight the number of 
shortest paths between pairs of source and sink nodes by 
the amount of flow which is known to go through the 
network between aggregated pairs of sources and sinks. 
To do this, we must first create a flow network by parti- 
tioning the substrate network, Gs = (V5, i?s), into a set 
of disjoint subgraphs Vp = {{Vs^,Es^), - ■ ■ , {Vsm,Em)}- 
The flow network Gp = {Vp, Ep) is then defined as the 
directed network of fiows among the subgraphs in Vp, 
where the links Ep are weighted by the value of aggre- 
gate flow among the Vp. For our purposes, the substrate 
network is the trans-European gas pipeline network rep- 
resented in Fig. [T] and the flow network is the network of 
international gas trade movements by pipeline in Fig. 

The generalized betweenness centrality (generalized be- 
tweenness) of link eij € Es is defined as follows. Let Tk.l 
be the fiow from source subgraph K — {Vk,Ek) S Vp 
to sink subgraph L = {Vl,El) S Vp. Take each link 



bkl G Ep and compute the betweenness centrality from 
Eq. ^ of eij S Es restricted to source nodes s G Vk 
and sink nodes t G Vl. The contribution of that flow 
network hnk is then weighted by Tk.l and normahzed 
by the number of hnks in a complete bipartite graph be- 
tween nodes in Vk and Vl, 



\Vk\\Vl\ <Jst 



(3) 




B. Generalized max-flow betweenness vitality 

The maximum flow problem can be stated as fol- 
lows: In a network with link capacities, we wish to 
send as much flow as possible between two particular 
nodes, a source and a sink, without exceeding the ca- 
pacity of any link [33 |. Formally, an s-t flow network 
Gp = {Vp, Ep, s,t,c) is a digraph [46| with node-set Vp, 
link-set Ep, two distinguished nodes, a source s and a 
sink t, and a capacity function c : Ep — > Mj. A feasible 
flow is a function / : Ep — > satisfying the following 
two conditions: < f{eij) < c(eij), Ve^ S Ep (capacity 
constraints) ; and Ey.e.^eEp f i^^Ji) = E^-e.^eB^, /(^y): 
Vi G T^\{s,i} (flow conservation constraints). The maxi- 
mum s-t flow is defined as the maximum flow into the 
sink, fst{Gp) = max(X;i:e,jg£;y /(cjt)) subject to the 
conditions that the flow is feasible [H, [H, . 

We are now interested in the answer to the question: 
How does the maximum flow between all sources and 
sinks change, if we remove a link Cij from the network? 
In the absence of a detailed flow model, we calculated 
the flow that is lost when a link Cij becomes nonopera- 
tional assuming that the network is working at maximum 
capacity. In agreement with Eq. ([3]) , we define the gener- 
alized max-flow betweenness vitality [28l . [ssj (generalized 
vitality): 



E 



Tk.l K/ie^) 



(4) 



where the amount of flow which must go through link 
eij when the network is operating at maximum capac- 
ity is given by the vitality of the link [2^: A^''(ei, ) = 



fst{Gp)~ fst{Gp\e,j) 
flow in Gp. 



and fst{Gp) is the maximum s-t 




FIG. 6: (Colour online) Generalized betweenness (gray) and 
vitality (black) measures on (a) a rooted tree, (b) a modified 
rooted tree with interconnections at a chosen level, and (c) 
two communities connected by one link. Nodes are shaped 
according to their function: source nodes are squares, and sink 
nodes are circles. Both generalized betweenness and vitality 
depend on Tk.l, which is a constant for all examples. The 
smaller value of the two quantities is always drawn on the 
foreground so that both measures are visible. 



C. Generalized Betweenness Centrality versus 
Max-flow Betweenness Vitality 

A close inspection of Eq. ([3|) , generalized betweenness, 
and Eq. (j4|), generalized vitality, reveals that both mea- 
sures have the physical units of gas flow given by Tk.l- 
Further, the relative number of shortest paths crossing a 
link Cij is bounded by < < 1, and the relative 

quantity of flow which must go through the same link Cy 



is also bounded by < ^^"^^^ < 1. Thus, the general- 
ized betweenness ^ and generalized vitality ^ can be 
compared for each link. 

To examine the relationship between these two quanti- 
ties, we considered three simplified illustrative networks: 
a rooted tree where the root is the source node and all 
other nodes are sinks, the same rooted tree with addi- 
tional links interconnecting children nodes at a selected 
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FIG. 7: (Colour online) Trans-European natural gas network. Link thickness is proportional to the generalized betweenness 
centrality [see Eq. ((3] where the sets K and L are countries and the values of Tk.l are taken from the data in Fig. [2]. We 
labeled several major EU pipeline connections. The large difference between the generalized betweenness on these pipelines 
and the rest of the network suggests that the network has grown, to some extent, to transport natural gas with minimal losses 
along the shortest available routes. 



level, and two communities of source and sink nodes con- 
nected by one single link. We chose these particular ex- 
amples because they resemble subgraphs which appear 
frequently on the European gas pipeline network and thus 
they may help us to gain insights into the structure of the 
real world network. 

Both the generalized betweenness and vitality have 
the same values on the links of trees where the root is 
the source node and the other nodes are sinks. To see 
this, consider without loss of generality the case when 
Tff,L = 1. Then, the generalized betweenness of a link is 
the proportion of sinks reachable (along shortest paths) 
over the link, and the generalized vitality is the propor- 
tion of sinks fed by the link. The two quantities have the 
same value on the links of a tree and we illustrated this 
in Fig. [DJa), where we drew link thickness proportional 
to the generalized betweenness (gray) and generalized vi- 
tality (black). 

Figure [SJb) shows a modified tree network where we 



have connected child nodes at a chosen level. Here, the 
shortest paths between the root and any other node are 
unchanged from the example of the tree, but removing 
a link situated above the lateral interconnection does 
not cut all connections between the source (root) and 
sink nodes. As a consequence the values of generalized 
vitality are significantly smaller in the upper part of the 
graphs. Figure EI^c) shows two communities connected 
by one link, where source nodes are on one community 
and sink nodes on the other. This example is interesting 
for two reasons. First, the arguments used to explain 
why generalized betweenness and vitality take the same 
values on trees are also valid in this example. Second, the 
link connecting the two communities has a much higher 
value of generalized betweenness and vitality than the 
links inside the communities, which led us to expect that 
these two measures could hint at the presence of modular 
structure in the real world network. 
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FIG. 8: (Colour online) trans-European natural gas network. Link thickness is proportional to the generalized max-flow 
betweenness vitality [see Eq. (j4|, where the sets K and L are countries and the values of Tk,l are taken from the data in 
Fig. [2]. Pipelines close to the major sources tend to have high values of generalized vitality because this is where the network 
bottleneck is located. Pipelines along sparse interconnections between larger parts of the network (e.g., the Spanish-French 
border) also tend to have high values of generalized vitality, when compared to neighbouring pipelines. 



IV. NETWORK ROBUSTNESS 

The generalized betweenness measure, defined by Eq. 
dS]) , assumes that gas is transported from sinks to sources 
along the shortest paths. To investigate whether this hy- 
pothesis is correct, we plotted the European gas pipeline 
network and drew the thickness of each pipeline propor- 
tional to the value of its generalized betweenness (see 
Fig. [7]). We found that major loads, predicted by the gen- 
eralized betweenness centrality, were on the well-known 
high capacity transmission interconnections such as the 
"Transit system" in the Czech Republic, the "Eustream" 
in Slovakia, the " Yammal-Europe" crossing Belarus and 
Poland, the " Interconnector" connecting the UK with 
Belgium or the "Trans-Mediterranean" pipeline linking 
Algeria to mainland Italy through Tunisia and Sicily. 
The dramatic difference between the values of general- 
ized betweenness of all the major European pipelines and 
the rest of the network suggests that the network has 



grown to some extent to transport natural gas with min- 
imal losses along the shortest available routes between 
the sources and end consumers. These major pipelines 
are the transportation backbone of the European natural 
gas network. 

During the winter season, cross-border pipelines are 
used close to their full capacity [l9| . In this situation, 
the generalized vitality of a pipeline [Eq. (|4])] can be in- 
terpreted as the network capacity drop, or the amount 
of flow that cannot be delivered, if that pipeline becomes 
nonopcrational. The obvious drawback of the general- 
ized vitality is that it takes into account the overall ex- 
isting network capacity without considering the length 
of paths. Conversely, the generalized betweenness con- 
siders the length of shortest paths, but not the capacity 
of pipelines. Since we assess the network from two com- 
plementary view points, we expect that the results will 
allows us to get a more complete picture of the general 
properties of the European gas pipeline network. 
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Figure [8] shows the values of the generaHzed vitaHty 
in the European gas pipeHne network. We found sev- 
eral relatively isolated segments with high a generalized 
vitality located in Eastern Europe, close to the Spanish- 
French border, as well as on the south of Italy. The high 
values of the generalized vitality in Eastern Europe can 
be explained by two factors. On one hand, the gener- 
alized vitality of pipelines close to the sources is higher 
than elsewhere simply because these pipelines arc the 
bottleneck of the network. On the other hand, our ap- 
proximation that a directed link in the flow matrix im- 
plies gas flowing from all nodes in the source to all nodes 
in the sink countries was clearly coarse-grained for flows 
between Russia and the Baltic states, as it would imply 
that pipelines in southern Russia would also supply the 
Baltic countries. This highlights boundary effects on the 
calculation of bctweenness vitality, as the data set ex- 
cluded most of the Russian gas pipeline network. The 
case of the Spanish-French border was different, though. 
The link with high vitality separates the Iberian Penin- 
sula from the rest of mainland Europe. If this link was 
to be cut, then Portugal and Spain would only be linked 
to the pipeline network through Morocco. Finally, the 
south of Italy highlights an interesting example of two 
communities (Europe and North Africa) separated by 
the Trans-Mediterranean pipeline, which is reminiscent 
of the example in Fig. [Hfc) . 

Perhaps surprisingly, we found that the generalized vi- 
tality is more or less homogeneous in most of mainland 
Europe. This result suggests that the EU gas pipeline 
network has grown to be error tolerant and robust to the 
loss of single links. 

Distribution networks originate from the need for an 
effective connectivity among sources and sinks (36l . [37| . 
For example, a spanning tree is highly efficient as it trans- 
ports goods from sinks to sources in a way that shortens 
the total length of the network, thereby increasing its 
efficiency and viability. If the European gas pipeline net- 
work had been built as a spanning tree, its links would 
have very similar values of generalized betweenness and 
max-flow vitality (see Fig. [6]). 

The values of the generalized betweenness are consider- 
ably higher than the corresponding values of generalized 
vitality for the most important pipelines in the European 
Union. In other words, the major pipelines are crossed 
by many shortest paths, but a nonoperational pipeline 
causes only a minor capacity drop in the network. This 
dramatic contrast between the two measures reveals a 
hot backbone [38| showing that the trans-European gas 
pipeline network is robust, i.e. error tolerant to failures 
of high load links. 



V. CONCLUSIONS 

We analysed the trans-European gas pipeline network 
from a topological point of view. We found that the Euro- 
pean national gas pipeline networks have approximately 



the same value of average node degree, even if their sizes 
vary over three orders of magnitude. Like the electri- 
cal power grid, the degree distribution of the European 
gas transmission network decays exponentially. Unex- 
pectedly, the degree distribution of the complete (trans- 
mission and distribution) gas pipeline network is heavy 
tailed. In some countries which arc crucial for the transit 
of gas in Europe (Austria, the Czech Republic, Italy and 
Slovakia), we found that the main gas pipelines are or- 
ganized along high capacity corridors, where capacity is 
split among two or more pipelines which run in parallel, 
over-bridging long distances. This implies that the net- 
work is error tolerant because the failure of one pipeline 
causes only a decrease in flow. Motivated by the finding 
of error tolerance in national networks, we then addressed 
the problem of capturing the topological structure of the 
European gas network. 

At a global scale, the growth of the European gas 
pipeline network has been determined by two compet- 
ing mechanisms. First, the network has grown under 
cost and efficiency constraints to minimize the length 
of transport routes and maximize transported volumes. 
Second, the network has developed error tolerance by 
adding redundant links. The combination of the two 
mechanisms guarantees that the European gas pipeline 
network is robust, i.e. error tolerant to failures of high 
load links. To reveal the network robustness, we analysed 
two measures — the generalized betweenness and general- 
ized vitality — which highlight global backbones of trans- 
port efficiency and error tolerance, respectively. Finally, 
we proposed that the hot backbone of the network is the 
skeleton of major transport routes where the network is 
robust, in other words, where values of generalized be- 
tweenness are high and values of generalized vitality are 
low. Our method is of potential interest as it provides a 
detailed geographical analysis of engineered distribution 
networks. 

Further research in continent-wide distribution net- 
works could proceed along several directions. The op- 
timality of existing networks and the existence of scal- 
ing laws could be approached from a theoretical perspec- 
tive [4^ . l43j . Planned and under-construction pipelines 
may change the robustness of the network, in particular 
within their geographical vicinity. Liquefied natural gas 
(LNG), which is nowadays transported at low cost be- 
tween continents, is increasingly supplying the pipeline 
network. The combined effect of LNG and storage fa- 
cilities throughout the European coastline has the po- 
tential to reduce the dependency on one single export- 
ing country, such as Russia. Last, but not the least, 
the dispute between Russia and Ukraine in January 2009 
has brought supply security to the top of the European 
political agenda and highlighted how the European gas 
network is robust to engineering failures, yet fragile to 
geopolitical crises. 
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APPENDIX: PIPELINE CAPACITY 

The capacity of a pipeline can be schematically derived 
as follows: It is known that the flow of an incompress- 
ible viscous fluid in a circular pipe can be described in 
the laminar regime (with a parabolic velocity profile) by 
the Hagen-Poiseuille equation [3§| , which states that the 
volume of fluid passing per unit time is 



dV/dt = Ti/^pr^/ {8ril) , 



(A.l) 



where Ap is the pressure difference between the two ends 
of the pipeline, / is the length of the pipeline (thus, —Ap/l 
is the pressure gradient), rj is the dynamic viscosity, and 
r is the radius of the pipeline. However, the gas net- 
work operates in the turbulent regime, and the Hagen- 



Poiseuille equation is no longer valid. Therefore, we ap- 
ply the Darcy-Weisbach equation for pipeline head loss, 
h f . This is a phenomenological equation which describes 
the loss of energy due to friction within the pipeline and 
is valid in the laminar and turbulent regimes [40| : 



2gd' 



(A.2) 



where / is called the Darcy friction factor, d is the 
pipeline diameter, v is the average velocity, and g is the 
acceleration of gravity. Equation (|A.2[) can be written as 

a function of the volumetric flow rate dV/dt = tt (5) v 
(which is the capacity of the pipehne [4l|), as 



16 f I {dV/dtf 



(A.3) 



In general, the friction factor / and the pipeline loss hf 
depend on the pipeline diameter d, so the capacity of the 

pipeline is given by c = dV/dt = f (77) d^^^ d'' ^ 
where typically 7 ~ 2.6 for gas pipelines plj . 
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